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ABSTRACT 

Let V. (n) be the minimum number of binary comparisons that 
are required to determine the i-th largest of n elements drawn 
from a totally ordered set. In this thesis we use adversary 
strategies to prove lower bounds on V.(n). For i=3, our lower 
bounds determine V_(n) precisely for infinitely many values of n, 
and determine V_(n) to within 2 for all n. For a general fixed 
i, our lower bound has the asymptotic form n + (i-l)logn - 
O(log(log*n)) where log*n is a very slowly growing function. As a 
result, the asymptotic behavior of V.(n) is determined to within 
O(log(log*n)). A more general problem is raised in which one wants 
to find an element which is (i,j) -mediocre, i.e., smaller than at 
least i elements and greater than at least j elements. For i=l, 
it is shown that the best algorithm is to select the i+lst 
largest of any subset of i+j+1 elements. It is an interesting 
question whether for general i this procedure is also optimal for 
finding an (i,j) -mediocre element. An affirmative answer to this 
question would imply V / ? (n) ^ 3n . 



CHAPTER 1 
PRELIMINARIES 

1.1 Introduction 

In this thesis we are interested in the problem of determining 
the i-th largest element of a totally ordered set of n objects. 
We shall concentrate on selection methods in which only pairwise com- 
parisons are allowed. 

The history of this problem dates back to 1883 when Lewis 
Carroll in an essay [2] pointed out that the usual playing procedures 
of a "knockout" tennis tournament would in most cases select the 
wrong second and third best players. Therefore in the essay he set 
out to devise a plan for finding the true second and third best 
players. There are of course many ways to achieve this goal, and a 
question that naturally arises is "In how few matches (assuming 
transitivity and antisymmetry) between n players can one decide 
the i-th best player?" Let V.(n) be the answer to this problem. 
The other closely related problem is to select in order the first, 
second, ..., and the i-th best players, and the minimum number of 
matches required here we denote by W.(n). 

Some obvious properties of V and W are: 
(i) V.(n) < W.(n) 
(ii) V^n) = V n _ i+1 (n) , W i (n) = W n _ i+1 (n) (by symmetry) 



(iii) W.(n) s riog(i!)1, an information theoretic lower bound. 

(All of our logarithms are taken to the base 2.) 
(iv) V 1 (n) = W 1 (n) = n - 1 (cf. also Sec. 1.4) 

In 1932 Schreier[7] gave an algorithm for finding the first and 

second best of n players that requires at most n - 2 + Tlognl 

matches. This construction was generalized by Kislitsyn[3] in 1964 to 

show that 

i-2 
W.(n) <; n - i + £ riog(n-k)1 for all i. (1) 
1 k=0 

His algorithm uses first i stages of "tree selection" sort. (cf. 
Knuth[4], Sec. 5.2.3) We first set up a knockout tournament of n 
players, and determine the best player in n-1 matches. We then 
"output" the champion and select the best of the remaining players. 
After repeating this procedure i times, we will have found the first, 
second, ..., and the i-th best players. Note that after the initial 
tournament is set up, at each later stage only a portion of the tour- 
nament needs to be reconstructed in order to find the new champion. 
One can in fact show that the first i stages require at most the 
number of matches as shown on the right hand side of (1). 

If only the i-th best player is desired, Hadian and Sobel[5] 
proved that 

V.(n) <: n - i + (i-1) flog(n-i+2)1 for all i. (2) 



Their construction is as follows: One sets aside a set S of i-2 
players and perform a knockout tournament on the remaining n-i+2 guys. 
(This requires n-i+1 matches.) The champion of this tournament ranks 
higher than n-i+1 players, and hence is too good to be the i-th 
best player. We replace him by a player from S. One rebuilds 
part of the tournament as needed to determine the new champion (which 
requires at most riog(n-i+2jl matches). Again, we replace the cham- 
pion after finding him. After i-1 passes, we have successfully 
eliminated i-1 players that are overqualified. Now the champion 
of the remaining n-i+1 players in the tournament must be the true 
i-th best player. (The last step takes flog(n-i+2)~l - 1 matches.) 
Summing the matches, we get Equ.(2). 

For small i, both of the algorithms described above seem to be 
rather efficient. However, they require O(nlogn) comparisons when 
i = 0(n). For a while it was not clear whether any efficient algori- 
thms might exist in the case of large i. Finally, in 1971 Blum, 
Floyd, Pratt, Rivest, and Tarjan[l] proposed a uniform procedure 
which can find the i-th largest of n elements in 5.73n compari- 
sons for all i. 

A very challenging task in the theoretical study of algorithms 
is to prove lower bounds on the complexity of a given problem. Until 
recently little was known about lower bounds in the tennis tournament 
problem. The first step was made in 1964 by Kislitsyn who showed, in 



the same paper cited above, that n - 2 + flognl is in fact the mini- 
mum number of comparisons required by any algorithm to find the largest 
and the second largest elements. Therefore, Schreier 1 s construction 
gives an optimal algorithm. In 1971 Blum et al[l] proved that 
n + i + Tlognl - 4 is a lower bound for selecting the i-th largest 
element where i ^ n/2. Recently this result was further improved by 
Pratt [6] to give 

V.(n) 2: n + 2i - logn for i £ n/3, 
and V.(n) 2: (3n+i)/2 - logn for n/3 <: i <. n/2. (3) 

Impressive as this bound is, it still leaves a considerable gap be- 
tween upper and lower bounds when i is small. As a matter of fact, 
the asymptotic behavior of the Hadian-Sobel upper bound (2) is 
~ n + (i-l)logn while that of the previous lower bound is ~ n + logn. 
It is therefore an intriquing problem to determine what the true 
asymptotic behavior of the optimal algorithm is. 

In this thesis, we present lower bounds for the selection of the 
i-th largest element and the selection of the largest i elements (in 
order). For i small relative to n, the gap between upper and lower 
bounds for either problem now has the leading term (i-1) (i-3)log(log*n) 
where log*n is a very slowly growing function, (cf. its definition 
in Sec. 3. 2) In the case of i = 3, we have a tighter lower bound 
which actually determines V_(n) precisely for infinitely many values 
of n ; and determines V,(n) and W_(n) to within 1 or 2 for all 
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1.2 Tree Algorithm 

Let x, , .... x be n distinct numbers. To select the i-th 
1 n 

largest of x. , . . . , x is to determine x , l£r£n, such that 

£ *k I *k > X r ? * ± " 1 and f *k | *k < X r^ ^ « - i • 

We shall write F. (x.. , .. . , x ) = r to denote that x is the i-th 
1 1 n r 

largest of x^ . . . , x . To sort Xp .... x^ , then, is to compute 

F (x, , .... x ) simultaneously for Lsi^n — or equivalently, to 
i 1' n 

determine a permutation p(l)p(2)« "p(n) such that * n\ > x p (2) > 

••• > x , v . We write G(x, , ..., x ) = p for the permutation p 
p(n) 1' n 

which satisfies this condition. 

It is quite obvious that if x 1 , ..., x and y 1 , ..., y n are 
two sequences each consisting of n distinct numbers, with the pro- 
perty that (Vj)(Vk)[ x. < x^ <* y. < y fc ] , then we must have 
F i (x 1 , ..., x n ) = F i (x 1 , ..., x n ) for all i, and G( Xl , ..., x n ) = 
G(y-, ..., y ). In other words, for operations such as sorting or 
selecting the i-th largest element, the answer depends solely on the 
ordering relation between the input elements. Hence, for this class 
of problems, it is natural to study computing techniques that are 
based entirely on pairwise comparisons between the input items. 

An algorithm that satisfies the above constraint can be repre- 
sented by a binary tree structure, such as that shown in Figure 1, and 



will be called a tree algorithm . Each internal node contains two 

indices "i:j", denoting a comparison of the i-th input x.^ versus the 

j-th input x.. For simplicity, we shall always assume that all inputs 

are distinct, so that each comparison may have two outcomes. The left 

subtree of this node then represents the subsequent comparisons to be 

made if x < x , and the right subtree describes the succeeding moves 
i J 

when x. > x.. Since we will be interested in minimizing the number 




Figure 1 A tree algorithm for selecting the 
second largest of four elements 



of comparisons needed, we may assume that no redundant comparisons 
occur in a tree algorithm. Thus the left and right subtrees of an 
internal node must both be accessible, and hence all external nodes 
are reachable from the root. If this algorithm is to compute the i-th 
largest of n items, then each external node of the tree shall con- 
tain an index r denoting the fact that F^Xp ..., x^) = r has 
been established as a result of the comparisons made along the path 
from the root to this node. Similarly, for sorting algorithms each 
external node shall contain a permutation which equals GCx.^, ..., x n )- 

As a complexity measure, we shall define the cost of a tree 
algorithm to be the maximal path length of its comparison tree. 
Thus, the functions V.(n) and W.(n) that are informally defined 
in the last section can be expressed as 

V.(n) = min [cost of A \ A is a tree algorithm that computes 

F jL (x 1 , ..., x^)} 

W.(n) = min {cost of A | A is a tree algorithm that computes 

F.(x n , .o., x ) simultaneously for l^j^i } 
i 1 n 



10 
1.3 The Concept of a Strategy 

The cost of a tree algorithm can be considered from a game-theo- 
retic point of view. This approach, as we shall see, enables us to 
obtain tight lower bounds on the minimum number of comparisons required 
to do various selections. 

For example, we may look at the computation of the i-th largest 
of n items as a two-person game in the following way. There is 
player A who makes the opening move, and whose move shall always be an 
inquiry of the form "Is x. less than x.?". The adversary B, in 
his turns, shall give a reply of either "yes" or "no" to the ques- 
tion that A just asked. The game ends when A can successfully deter- 
mine the i-th largest element. The payoff of B in this game is 
defined to be the total number of questions asked by A. 

Now, recall in game theory, a player's "strategy" is simply a 
complete specification of what he would do in each situation that 
might arise in the play of the game. In the game described above, any 
strategy that A may employ corresponds to a tree algorithm for select- 
ing the i-th largest element, and vice versa. Once A has selected a 
strategy (i.e., a tree algorithm is given), the choice of a strategy 
on B's part will then completely determine the path that the computa- 
tion is going to follow in the comparison tree of this algorithm. 
Therefore, if we can show that for any strategy A may use, B can 
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always find an "adversary strategy" that quarantees a payoff of at 
least L.(n) for B, then we in fact will have proved that 

V (n) s L (n). 



Notice that, as far as proving lower bounds for tree algorithms 
is concerned, adversary B has the privilege of consulting the strategy 
which A has chosen while he makes up his own. However, all the adver- 
sary strategies that are considered in the following chapters do have 
uniform prescriptions, i.e., decisions are solely based on previous 
moves and do not depend on the knowledge of A's strategy. 

In giving descriptions of strategies and situations that might 
arise in a computation, it is convenient to use the terminology of 
tennis tournaments. After all, this is where the selection problem 
originated! So, let us regard each itiput item as a tennis player. 

A comparison between x. and x. then becomes a "match" of x. 

J- J i 

versus x.. If the outcome is x. < x., we say "x. is defeated by 

J i 3 i 

x.": if x. > x., we say "x. beats x." etc. (Here we have to 
assume that tennis skill satisfies transitivity, i.e., if x. beats 
x. and x. beats x, , then x. would beat x, .) A sequence of matches, 
each tagged with its outcome, is called a tournament , usually denoted 
by T. The length of a tournament T, written as |T{, is the total 
number of matches T contains. Thus the payoff of adversary B corre- 
sponds to the length of the resulting tournament. In connection with 
a tournament T, we shall let time t refer to the point when the t-th 
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1.4 Crucial vs. Noncrucial Comparisons 



With respect to a tournament T, let us write x. < x^ if in the 
final ranking x. is determined to be dominated by x,. We write 
x. <: x_ if either x. < x^ or j = k. It is easy to see that 

1) if x, < x , then in some match x, must be defeated by an x 
such that x. ^ x . 

2) if x < x,, then in some match x.must beat an x. such that 

x ^ x. . 

r J 

Now, if T selects the i-th best player, there must be a player x^_ 
such that 

K 3 ^ I X k <X r ] l = n " i and i fx kl X r <X k } l =i " 1 

Hence for each player other than x we must be able to isolate a 

match of either type 1) or 2), which we call a crucial match. Thus 

T must contain n-1 crucial matches. Alternatively, if we look at 

the Hasse diagram that represents the partial ordering of the players 

as established by T, n-1 crucial matches are those required to form 

the links of a spanning tree on the n nodes, with x looking like a 

"bottleneck", (cf . Figure 2) Thus we know that 

V.(n) £ n - 1 for all i. 
i 

Incidentally, this proves V, (n) = n - 1 since it is easy to see that 
V 1 (n) ^ n - 1. For general i, in order to improve this estimate on 
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the lower bound of V.(n), the adversary must try to entrap A into 
making a large number of comparisons that are noncrucial . 




Figure 2 Crucial matches (solid lines) and noncrucial matches 
(dotted lines) for selecting x as the fourth best 
of 12 players „ 



15 

CHAPTER 2 

SELECTION OF THE 
THIRD BEST PLAYER 

2. 1 Introduction 

The adversary approach, as introduced in Sec. 1.3, was used by 
Knuth (Sec. 5.3.3 of [4]) in proving Kislitsyn's theorem that n - 2 + 
T lognl comparisons are indeed necessary for finding the second largest 
of n items. In this chapter, we present Basic Strategy which is a 
slight modification of the strategy Knuth used. We shall first use 
Basic Strategy to reconstruct Knuth' s proof, and then proceed to estab- 
lish lower bounds for V,(n) and W (n). Lemma 1 describes a specific 
situation that must arise in any computation controlled by Basic Strategy. 
By taking advantage of this situation, one can create a large number of 
noncrucial comparisons, and lower bounds for V 3 (n) and W 3 <n) are obtained 
thereby. As a consequence, V (n) is determined precisely for infinitely 
many values of n, and to within 1 or 2 for general n. W 3 (n) is also 
determined to within 1 or 2 for all n. 
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2.2 Weight function and Basic Strategy 

In Sec. 1.3 we introduced the notion of using strategies to 
establish lower bounds for selection problems. The Basic Strategy, to 
be defined below, shall play an essential role in the forming of our 
final strategies for the adversary. It is a modification of the strategy 
Knuth used in proving Kislitsyn's lower bound onV 2 (n). First we 
have to define a weight function: 

Definition 1 With respect to a tournament T, Q (x) where x is any 
player and O^t^JTJ , is a positive integer known as x's weight at 
time t , and is defined recursively as follows: 
Ql. Q Q (x) = 1 for all x . 
Q2. If x inflicted y's first loss in the t-th match of T, then 

Q t (x) = Q t _ 1 (x) + Q t _ 1 (y) , a n d Q t («) = Vl (z) f ° r a11 Z 
such that z ^ x . 

Q3. If the loser of the t-th match was previously beaten, then 

Q t (x) = Q t _ 1 (x) for all x . 

We can now present 
Definition 2 Basic Strategy 
Assume x plays y in the t-th match of T: 

BSl. If x is yet undefeated and y is not, then let x win. 
BS2. If both x and y are undefeated and Q .(x) > Q . (y) , 

then let x win. 
BS3. In other circumstances, let the outcome be arbitrary. 
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Definition 3 A tournament is said to be BS-ruled if the outcomes of 
the successive matches are decided by Basic Strategy. 

The following are some central effects of our Basic Strategy. 
Fact 1 In a BS-ruled tournament, once a player is defeated, his 
weight does not increase from then on. 

Proof By the definition of Q (x) , a player's weight increases only 
when he makes a first loss. From BSl, a player who has been defeated 
cannot inflict any first loss, therefore his weight shall stay the 
same. 

Fact 2 In a BS-ruled tournament, a player's weight is at most 2 
after he makes k first defeats. 

Proof A player's weight increases only with every first defeat he 
makes. By BS2 and Q2 , his weight at most doubles when he does 
make a first defeat. Since the initial weight is 1, Fact 2 is 
proved by induction on k. 

In the definitions of weight and Basic Strategy, only a player's 
first defeat is relevant. Indeed, we can represent the players' 
weights in a BS-ruled tournament T by a tree structure, called the 
first-defeat-tree (abbr. FDT) of T. Every player is represented by 
a node in the FDT, and y is a son of x iff x inflicted y's 
first loss in T. Thus the FDT may be a disjoint union of several 
rooted trees, with every root representing an undefeated player. (cf. 
Figure 3) In view of Fact 1, we shall let Q(x) denote the 



maximum 
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value of x's weight in a BS-ruled tournament. Then we have 
Fact 3 Given a BS-ruled tournament T, let D(x) be the set of x's 
descendants (including x itself) in the FDT of T. Then Q(x)= |d(x)| 
Proof Argue by induction on the number of first defeats x makes. 
Use Q2 and Fact 1. (Details omitted). 

Thus, by Fact 3, Q(x) corresponds to the total number of nodes 
(or the actual "weight") of the subtree rooted at x in the FDT. For 
example, in Figure 3 we have Q(x)=10 and Q(y)=4. 





Figure 3 First-defeat-tree of a BS-ruled tournament 
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It is now a simple matter to prove 
Theorem (Kislitsyn) V 2 (n) £ n - 2 + flog nl . 

Proof Given any tree algorithm for selecting the second best player, 
we apply the Basic Strategy to it. Note that the champion is also 
determined in the resulting tournament T (he is the singular player 
dominating the selected second best), call him X £ . Since he must 
be the only undefeated player in T, in the FDT of T he is hence the 
unique root. By Fact 3, we have Q(x ) = n . Therefore x c must 
have made at least Tlog nl first defeats in T. However, at most one 
of these could be a crucial match (cf. Sec. 1.4) as far as selecting 
the second best player is concerned. Since there must be n - 2 
other crucial matches, we conclude that T 2: n - 2 + Flog nl . This 
proves that V ( n) £ n - 2 + flog nl . 
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2. 3 Nearly Exact Bounds for i=3 



In this section, we proceed to prove fairly tight lower bounds for 
V-(n) (the minimum number of comparisons for selecting the third largest 
element), and W (n) (that for selecting the first, second, and third 
largest elements in order) . 



2.3.1 Main Theorem 

The main result of this section is the following theorem: 

Theorem 1 V (n) ^ H(n), where for k :> 1, 

,n - 3 + 2 TlogC n-l)1 if n = 2 k + 1 

H(n) =<n - 4 + 2riog(n-l)1 if 3«2 k_1 < n £ 2 k+ 

^n - 5 + 2riog(n-l)"l if 2 k + 1 < n s; 3* 2 k ~ 

By comparing H(n) with Hadian and Sobel's upper bound (cf. Sec. 
1.1) V (n) <: n - 3 + 2Flog(n-l)1 we see that V (n) is determined 
precisely if n = 2 + 1 ; and is determined to within 1 or 2 for 
general n . Since W (n) £ V ( n) ^ H(n) , comparing H(n) with 
Kislitsyn's upper bound W„(n) ^ n - 3 + Tlog nl + TlogCn-l)! shows 
that W (n) is determined to within 1 or 2 for all n . 

The following lemma is the key to the proof of Theorem 1. To 
simplify notation, let us define 

h(n) = H(n) - n + 1 . 
It may be helpful to keep in mind that h(n) is an estimate of the 
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number of noncrucial (i.e., wasted) comparisons that the adversary 
is aiming to create. 

Lemma 1 Any BS-ruled tournament that selects the third best of n 
players must reach a point in time t when one of the following two 
situations arises: 

(1) There exist 3 undefeated players A, B, and C; A and B toge- 
ther have made at least h(n) first defeats so far. 

(2) There exist 2 undefeated players A' and B' who together 
have made at least h(n) +1 first defeats so far. 

We first show how Theorem 1 can be proved by using Lemma 1. The 
proof of Lemma 1 will be given in Sec. 2.3.2. 

Proof of Theorem 1 We present a strategy which, when applied to any 
algorithm for selecting the third best player, will result in a tour- 
nament that contains at least H(n) matches. The strategy has two 
phases: 
Phase I Basic Strategy 

Follow Basic Strategy until either situation (1) or situation 
(2) of Lemma 1 occurs. At this point t , switch to 
Phase II Clear Strategy 

CSl. As a follow-up strategy for situation (1), let A, B, C 
always win when they play other players. Among A, B, C we shall 
assign the order A > C, B > C . In other cases we do not care. 
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CS2. As a follow-up strategy for situation (2), let A', B' 
always win when they play other players. In other cases, we do not 
care. 

Now, in the case of situation (1) followed up by CSl, player 
C will necessarily be selected as the third best player, dominated by 
A and B. But then, none of the h(n) matches as mentioned in 
situation (1) is a crucial match as far as selecting the third best 
player is concerned. Hence the entire tournament must contain at 
least n - 1 + h(n) = H(n) matches. 

Similarly, in the case of situation (2) followed up by CS2, 
A' and B f necessarily become the two top winners of the tournament. 
Among those h(n) + 1 first defeats made by A' and B' before time t, 
one could be the first defeat of the third best player. However, this 
is the only case where those h(n) +1 matches may contain a crucial 
one. Since there ought to be n - 2 other crucial matches, we see 
that the length of the tournament is at least n - 2 + (h(n) + 1) , 
which equals H(n) . Theorem 1 is thus shown to follow from Lemma 1. 

2.3.2 Proof of Lemma 1 

It is clear that any BS-ruled tournament that selects the third 
best player must fall into one of the following two classes: 
(i) Those in which there is only one undefeated player left at the end. 
(ii) Those in which there are two undefeated players left at the end. 
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For the proof of Lemma 1, we need only consider case (i). Indeed, 
suppose the lemma is proved in this case. Let T be a tournament of 
class (ii), so that two undefeated players say x and y are found at 
time |t|. If Q. . (y) £ Q| T .(*), we may extend T by letting x play 
and defeat y in an additional match. Call the resulting tournament 
T 1 . Clearly T 1 has been following Basic Strategy, and moreover is in 
class (i), so the lemma is true for T'. However, the particular time t 
in T' as characterized by the lemma must satisfy t ^(T|, since at time 
|t|+ 1 there is but one undefeated player x, not satisfying either 
assertion (1) or (2). Hence the lemma must be in fact true for T. 

Let us now look at the first-defeat-tree of a tournament of class 

(i). (cf. Figure 4) The only undefeated player, namely the champion, 

is denoted by x . Let x„, x, , ..., x be the sons of x arranged 
J c 1s c 

in the order their first defeats by x took place. Thus if t. , 

c j 

O^j^s , is the time x inflicted x.'s first loss, then t < t.< ••• 

< t . For O^i^s, let d. denote the number of first defeats made by 
s J 

x.. (By BSl, they must all take place before time t..) 

Claim Let M = max {j+d.lo^j^s} 

and I = min { j I <: j <: s, j +d. = M }, then 

(I) if & < s , we must have M ^ h(n). 

(II) if H = s , we must have M s h(n) +1 . 

Note that once Claim is proved, Lemma 1 will follow immediately. 
For, in case (I) is true, we can choose t to be the time t.- 1 . 
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At this moment, x , x^ and x^ +1 (note i+1 £ s ) are all undefeated 

since t -1 < t „ < t „ „ . By this time x has made i first defeats 

(namely on x , x^ ..., x^), and x^ has made d^ . Since by (I) 

I + d. = M 2> h(n), assertion (1) of Lemma 1 is fulfilled if we choose 

A. B, C to be x , x , and x„ , . On the other hand, if (II) of Claim 
' ' c' i' Z+l 

is true, we can choose t to be t - 1. At this time x and x are 

undefeated, and have made a total of s + d = M S h(n) + 1 first de- 
' s 

feats. Thus choosing A' and B' to be x and x will satisfy asser- 

c s 

tion (2) of Lemma 1. 




Figure 4 FDT with a single root 
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So we have reduced the proof of Lemma 1 to the verification of 
cases (I) and (II) of Claim. 

Proof of Claim 
(I) Assume I < s . 

By the definition of M, we have j + d. £ M for all O^j^s. In par- 
ticular, d TM/2l ^ M " fM/2 ^ = I-M/2J 

d rM/2l+l S M " ( fM / 2l+ V = LM/2J - 1 



d s; M - s 
s 

d. 

By Fact 2, Q(x.) <: 2 J for all 0<rj<:s. Hence we have 

«<*r„ /2 i> - * LM/2J 
«<*r H/2 W^ LM/2j - 1 



<!(*,) s 2 M " S 
Summing the weights, we get 

Z QC.) * 2^ + 2 LM/2J-1 + ... + 2 1 + 2 

Also, x has only made ) first defeats prior to time t., hence by 

° " J 

Fact 2 of Sec. 2.2 has weight at most 2 J before he plays x.. This 

implies that Q(x.) £ 2 J , for otherwise x. would not lose to x by 
J J c 

BS2 of Basic Strategy. In particular, this is true for x. such that 

.1 
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OsCj<rM/2l . Therefore, 

J "° ' . 2 fH/21 - 1 (3) 

On the other hand, since in the first-defeat-tree the union of the 

subtrees rooted at x., (Kj^s, contains every node except x , by 

J c 

Fact 3 of Sec. 2.2 we have 

s 

£ Q(x.) = n- 1 (6) 

.1=0 J 

Now, from Equs. (4), (3) and (6), we obtain 

Tm/21 IM/2J+1 

2 +2 LJ - 2 2j n - 1 

Solving this inequality for M and compare with the definition of 

h(n), we conclude that 

M £ h(n) . 

(II) Assume i = s . 

By the definition of i, we have M = s + d but j + d. < M for all 



s J 



* 



j<s. Hence, d TM/2l < M ~ l " M/2 " 1 = ^ M/2 J 

d r M / 2 i + i < L M/2 J - l 



d , < M - s + 1 
s-1 



d = M - s 
s 



Again, by Fact 2 we have 

£ „, N ,. J M /2j-l „M-s M-s 

E Q/x.) £ 2 LJ + ••• + 2 +2 

j = TM/2l J 



2 Lm/2J (7) 
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It then follows from E*tw, (3), («), (7) thst 

As a result, w* get 

M * h(n) + l , 
Thus, Clslw has bsaa provsd, sod ths srwf of 



|l^^^s«ai»aMI!»MjiS4^s»l- **«*• -r. 



$MMMsv 1 Is bow consists. 



-^ . , <•«. , _»_ 



:;- a ■?»-'"«*£ ©sis to tjc. _* _ t - „ 






'#& *rt 



^^>;<^S> 



is*© lO :. 






28 

CHAPTER 3 
GENERAL LOWER 
BOUNDS FOR SMALL i 

3.1 Introduction 

In this chapter, we shall generalize the techniques of 
Chapter 2 to deduce lower bounds on V.(n) and W^n) for snail i. 
Although we appeal to the same scheme of using a two-phase strategy, 
the analysis involved for i > 3 is considerably more difficult. 
The major effort is contained in the proof of Lemma 2, which is 
a generalization of the analysis we did on "first-defeat-trees" 
in Lemma 1 of Sec. 2. 3. 

For any fixed i, our lower bound for V.(n) and W^n) has the 
asymptotic behavior n+ (i-l)logn - O(log(log*n)) . By comparing it 
with upper bounds as given by Equs.(l) and (2) of Sec. 1.1, we see 
that the asymptotic behaviors of both V.(n) and W.(n) are deter- 
mined to within a term of order O(log(log*n)) . 
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3.2 Main Theorem 

We shall derive lower bounds on V.(n) and W.(n) for i > 3. 

i l 

by induction on i , based on the result established for i = 3 in 
the previous chapter. The resulting formulas for both of our lower 
bounds involve a quantity which is dependent on i as well as on n. 
This quantity, formally defined below as h.(n), is fairly close to 
log(n) when i is small relative to n . 

Notation For n&2, let log*n denote the largest integer k such 



that 2 



„*2 2 



•• K 



Thus, log*2 = log*3 = 1 , log*4 = log* 5 = ••• = log*15 = 2 , and so 
forth. 

Definition 4 For any i S 3, define h.(n) by the following formula: 

h 3 (n) = logn-2; \(8) 

for i > 3, h.(n) = logn - (i-3)log(log*n) - 21og((i-l)l) - (i-1) J 

Our main results of this chapter are contained in the following 
theorem: 

Theorem 2 For any n and i such that n > i £ 3, we have 

V,(n) 5s n - £ + (l-l)(h.(ti) - 1) (9) 

1 l 

W 1 (n) £ n - i + (l-l)h (n) (10) 
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Remark 1 If we define h-^n) = h 2 (n) = Tlognl , then the lower bounds 
as given by Theorem 2 also hold when i=l, 2. In fact, they are close 
to the precise bounds that are known in these cases: 



V 1 (n) = W x (n) = n - 1 

V„(n) = W_(n) = n - 2 + Tlognl (Kislitsyn's Theorem) 



Remark 2 For i=3, Theorem 2 gives the following lower bounds: 



V 3 (n) s: n - 3 + 2(logn - 3) 

= n - 9 + 21ogn, , 

W (n) s n - 3 + 2(logn - 2) 

= n - 7 + 21ogn . 



These are slightly worse than the lower bounds we actually proved for 
i=3 in Theorem 1 of Chapter 2. The reason is of course that here we 
would rather use a varied (and slightly weakened) form of Lemma 1, if 
only it could serve more conveniently as a basis for inductive argu- 
ments, (cf. Lemma 2 and its proof in Sec. 3. 4. 2) 

Remark 3 Compare the lower bounds of Theorem 2 with the upper 
bounds of Hadian-Sobel and Kislitsyn respectively. From (8), we have 



h^n) > Tlognl - (i-3)log(log*n) - 21og((i-l)l) - i 



since log + 1 > Tlognl . Hence by comparing Equs. (9), (10) with 
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(1) and (2) of Sec. 1.1, we see that the gap between upper and lower 
bounds on V. (n) and W.(n) is now less than 



(i-l)(i-3)log(log*n) + (i-l)[21og((i-l)!) + i + 1} • 



As a consequence, for any fixed i > 3, the asymptotic behaviors 
of V.(n) and W.(n) are determined to within 



(i-l)(i-3)-log(log*n) +0(1) . 

Remark 4 For large n and i, since 

log((i- )!) « log(i!) « i(logi) , 

we see from the definition of h.(n) in (8) that 
h.(n) « logn - i(21ogi + log(log*n)). 



Hence h.(n) approaches zero at i « ( logn) /(21og( logn)), 



Remark 5 When h.(n) gradually approaches zero or even becomes 
negative as i grows larger and larger, formula (9) is superseded by 
Pratt's lower bound as given by Equ.(3) of Sec. 1.1: 

V.(n) £ n + 2i - logn for i <i n/3 , 

V.(n) s (3n+i)/2 - logn for n/3_:<: i <; n/2 . 

Also, (10) may be replaced by the following lower bound on W (n) : 
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Theorem 3 For any n and i such that n ^ i, 
W.(n) ^ n - i + riog(i!)l . 

Proof This is purely by information- theoretical arguments. Suppose 
in a tournament A. , . . . , A (unordered) are known to be the best i 
players, and the order of the remaining n-i players is fixed. 
Then, just in order to completely determine the ranking of these i 
top players, flog(il)l matches between them could be necessary in the 
worst case. However, for each of the remaining n-i players, there still 
must be a match in the tournament in which he lost to someone whose 
ranking is no higher than the i-th. Hence these n-i "crucial" 
matches are completely disjoint from those riog(i!)l mentioned 
before, and this tournament contains a total of n - i + Tlog(il)l 
matches at least. 
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3.3 Proof of Main Theorem 

The proof of Theorem 2 depends on the following lemma the way 
Theorem 1 of Sec. 2.3 depends on Lemma 1. Indeed, Lemma 2 gene- 
ralizes the analysis of the first-defeat-tree as was done in 
Lemma 1. Here a "championship" tournament is a tournament which 
determines, among other things, the best player of all. 

Lemma 2 Let T be a BS-ruled championship tournament of n players. 
For any i such that i s 3, we assert that there must exist a time 
t in T when the following two statements are true: 
(i) There are ^i-1 undefeated players who have made a total of 

(i-l)h (n) first defeats at least. 
(2) The champion himself has made at least h.(n) first defeats. 
(The second statement serves mainly as an induction hypothesis that is 
needed for the proof of Lemma 2. ) 

As in Sec. 2.3, let us first assume that the lemma holds and 
prove Theorem 2. The proof of Lemma 2 will be given in Sec. 3.4 . 



Proof of Theorem 2 



Given an algorithm for selecting the i*th best pLayer, we can 
find the champion by using at most V-(i-l) = i-2 additional matches 
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to determine the best among the top i-1 players. This means that if 
V!(n) is the minimum number of comparisons for selecting the largest 
and the i-th largest element, then 



V (n) ^ V[(n) - (i - 2) 



and W j .(n) :> V|(n) 



Thus, to prove Theorem 2, it suffices to show that 



V!(n) :> n - i + (i-lOh^n) . 



Given an algorithm for selecting the champion and the i-th best 

player, we first apply Basic Strategy to it. By Lemma 2 there must be 

a crucial moment t when there are j ( l^j^i-1 ) undefeated players 

A A who have made a total of (i-l)h (n) first defeats at 

1 j L 

least. As soon as this occurs, we shall switch to a Clear Strategy. 
Under the latter strategy, A.^ ..., A shall always win when they 
play any of the remaining n - j guys; in other circumstances the 

outcome may be arbitrary. Now, A x A necessarily turn out to 

be the j highest ranking players of the tournament. However, as far 
as determining the i-th best player is concerned, V (n-i+1) = n - i 
matches have to be played between the n-i+1 low ranking players in 
order to determine the best among them. Since these players are com- 
pletedly disjoint from the j highest ranking players, we see that 
the tournament must contain no fewer than n - i + (i-l)h (n) matches 
all together. 
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3.4 Proof of Lemma 2 

3.4.1 Auxiliary Propositions 

Lemma 2 will be proved by induction on i. In trying to proceed 
from say i=q to i=q+l, the problem of delicate "timing" in the 
tournament turns out to be of vital importance. Propositions 1 and 2 
below are two instances of how one deals with this problem in the 
proof of Lemma 2. 

As in the proof of Lemma 1, we consider the first-defeat- tree of T, 
the given BS-ruled championship tournament. Let x , ..., x be those 
player's whose first defeats are made by x , the champion, at time 
t., ..., t , respectively with t. < t, < • • • < t . Also let d be 

■*■ S U 1 S j 

the number of first defeats made by x., O^j^s. (cf. Figure 4 of 
Sec. 2.3.2) Furthermore, for any x., O^j^s, let T. be the "subtour- 
nament" of T consisting of those matches (in the order they occur in T) 
represented by the edges of the subtree, rooted at x. , of the FDT. 
Let T! denote the subtournament consisting of those matches of T , ..., 
T 1 together with the first defeats of Xy .... x._. (also in the 
order they occur in T) . (cf. Figure 5) One can easily see that T. 
and T! , when considered as tournaments themselves, are both BS-ruled 

championship tournaments. Within T, both subtournaments T and T' 

j J 

are completed before time t. . 
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Figure 5 Sub tournaments T. and T| 

— e J J 



In terms of the notations defined in the last paragraph, we have 
Proposition 1 Suppose Lemma 2 is true for i = q ^ 3. If for a: 
given BS-ruled championship tournament T (of n players), there exist 
j and m satisfying the following conditions, then Lemma 2 is also true 
in T when i = q+1 : 
(i) 0<;j<:s and m <; n , 
(ii) Q(x.) ^ m and h (m) ^ h ,,(n) . 
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Proof Since x defeated x. at time t., by Basic Strategy we 
c j J 

must have 



V-^V s Q t.-i (x j ) • 



Since 



Q t .jOO = QOO > m > 
j 

this means that both tournaments T. and T'. involve at least m 

J J 

players. 



By our induction hypothesis, there exists a time t in T. when 

(1) some s q-1 undefeated players in T. have made a total of 
q J 

(q-l)h (m) first defeats at least; 

q 

(2) x. (the champion of T.) by this time has made at least h (m) 
first defeats. 



Similarly, there exists a time t' in T! when 

(l) 1 same as statement (1) , only T. is replaced by T| ; 

(2) 1 same as statement (2) , only x. is replaced by x . 

q q J c 



Now, if t f < t, then (2) 1 is also true at time t. Hence at 
' ' q 

time t, by (1) and (2) 1 , there are ^q undefeated players who 
have made a total of at least q*h (m) first defeats. Since by 

q 

assumption h (m) £ h ,-(n), this together with statement (2) above 

q q+i q 

show that both assertions of Lemma 2 are true in T for i=q+l . 
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On the other hand, if t<t', we will choose the time to be t'. 

Then statement (1)' and (2) above show that at time t 1 some £q 

q <J 

undefeated players have made a total of at least q*h (m) £ q»h ^(n) 
first defeats. Again, this together with (2)' show that Lemma 2 
is satisfied for i=q+l. 



Proposition 2 Suppose Lemma 2 is true for i = q ^ 3. If for a given 

BS-ruled championship tournament T of n players, there exists an m and 

q indices samed a, j,>-*-jj ■, , that satisfy the following 

conditions, then Lemma 2 is also true in T when i=q+l : 

(i) m^n, 0^a^j<j<...<j -| £ s and a ^ logn; 

(ii) Q(x. ) £ m for ls:£<:q-l , and a + (q-l)h (m) > *l* h q+ i( n ) • 



(cf. Figure 6) 




Figure 6 A tournament of Proposition 2 
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Proof Since Q(x. ) 5: m for Ki^q-1, this means that each of the 

H 

q-1 subtournaments T. , ..., T. involves at least m players. 

J l J q-1 



Applying the induction hypotheses, we see that for l£j2,£q-l, 
there exists time t' , t' < t. , such that at time t* the follow- 
ing two statements are true: 

a 

(1) Some ^q-1 undefeated players in T. have made a total of 
q 3 A 

(q-l)h (m) first defeats at least. 

a 

(2) x. himself has made at least h (m) first defeats, 
q 3^ q 



There are two cases: 
(I) t* < t - 1 for all l^iq-1 . 

If this is the case, then at time t - 1 , x has made a first 

' a c 

defeats (namely on x_, ..., x ,). Also, since t. s t > t - 1 , 
y 0' ' a-1 ' i. a a 



I 



x. are all undefeated at time t - 1 . Hence, by (2) , we know 
that at time t - 1, x and x. for Is&Sq-l are q undefeated 
players who have made a total of a + (q-l)h (m) first defeats . 
Since by assumption 

a ^ logn , hence a > h . (n) , 
and a + (q-l)h (m) s q«h +1 (n) , 

we see that Lemma 2 is true for this tournament when i=q+l . 



(II) t' S: t - 1 for some I such that l^Kq-l . 

For this particular I , x at time t' has again made at least 

C Jo 

a first defeats. And by statement (1) , at time t' there are =£q-l 
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undefeated players who together with x have made at least 

a + (q-l)h (m) :> q'h (n) first defeats. Hence again Lemma 2 is true 

q q 

when i=q+l for this tournament. Thus the proof of Proposition 2 
has been completed. 

3.4.2 The Inductive Proof 

We are now ready to prove Lemma 2. 
A) Basis of induction, i=3. 

Let us look at Figure 4 of Sec. 2.3.2. Suppose Lemma 2 were 
false, we would have j + d. < 21ogn - 4 for all j such that 
logn - 2 £ j £ s . (If not so, we can choose the time to be t -1 where 
j £ logn - 2 and j + d. S; 21ogn- 4 = 2h_(n) . At this time t - 1 , 
both statements (1)„ and (2) of Lemma 2 are satisfied.) Thus, 

dp -| < 21ogn - 4 -(Tlognl - 2) <; Tlognl - 2 

^lognl-^ ri ° gn1 " 3 



d < 2riognl - 4 - s 
s 



From Fact 2 of Sec. 2.2, we have 

E Q(x.) £ 2 ri °S" 1 - 3 + 2 r l°S"l-4 + ... + 2 1 + 2° 

rxcgj-2 i ^r^i-z., (11) 

(Note, if flognl - 2 <■ 0, then h.(n) £ for all i 4 3, and Lemma 2 is 
trivially true in this case. Hence we may assume n > 4.) 



41 



From (ll)and Equs. (5), (6) of Sec. 2.3, we get 

riognl-3 s 

n - 1 = S Q(x.) + S Q(x.) 

j=0 J j = riognl-2 J 

^ 2 riognl-2 _ l + 2 riognl-2 _ % 

£ rx - 2 
which is a contradiction. This proves Lemma 2 for i=3. 



B) Suppose the lemma is true for i=q*:3, we will proceed to prove 
it for i= q+1 . 

If for the given tournament, s ^ q'logn , then at time say t , 

s-q 

x and x __ a+ i > x s _ +2' * ' * ' x -1 are ^ undefeated players where 
x alone has made s-q+1 first defeats (namely on x , ..., x ). 
Since s-q+1 > q'logn - q = q(logn - 1) S: q«h ,«(«) , Lemma 2 is 
true for i=q+l. Hence from now on we shall assume s < q'logn . 



Notation For 1 <; k <: log*n , let log v 'n denote log* "log n , 



Ck) 
and ±^ denote logn +(q-l)log v n. Also, let n*= n/(21og*n). 



We shall divide the problem up into log*n cases. Case k . for 
l^ksiog*n - 1, makes the assumption 

£ Q(x ,) a n' . 
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The last case, Case log*n , makes the assumption 

£ Q(x.) > n/2 - 1 . 
0<rj<2(q-l)+logn 

It is seen that these log*n cases do cover all possibilities 
since we always have 

i 2 ^j<i 1 i 3^ j<i 2 0£j<logm + 2(q-l) 

> Y Q(x.) (because i ^< logn + 2(q-l). ) 
0<;j<:s 

= n - 1 

;> n/(21og*n) + n/(21og*n) + ••• + (n/2 - 1) . 



log*n - 1 

Now, Case k , l«ik£log*n - 1 , are proved in the same way. 
Proof of Case k, !<3cs:ioR*n - 1 

There are two possibilities: 
(I) The maximum of Q(x.), iy.+y'^k » is s n ' /fa* 1 ) 

If only we can show h (n/(q-l)) ^ Vn^ > then the desired result 
will follow from Proposition 1 of Sec. 3.4.1. Indeed, 



h (n'/(q-l)) £ logn - log(q-l) - 1 - log(log*n) - (q-3)log(log*n) 

q 

- 21og((q-l)l) - (q-D 
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> logn - (q-2)log(log*n) - 21og(q!) - q 



= Vl<"> 



Therefore we are done in this case. 



(II) The maximum of Q(x.), i fc+1 ^ j < i fc , is < n' /(q-1). 

In this case, let Q(x. ), ..., Q(x. ) be the q-1 largest 

J l 3 q-l 

weights among Q(x ), je[i k+1 , i fc ) . (note i fc - i fc+1 s= q-1 .) 
Then we must have Q(x. ) :> n' /((q-1) (i, - i, , . ) ) for all l^&Sq-l. 
Denote this last fraction by m . If we can show that i. .+(q-l)h (m)£ 
q*h .-.(n), then the conclusion will follow from Proposition 2 of 
Sec. 3.4.1. Thus, since i fc - i fc+1 £ (q-1) log ( ^n , and n'=n/(21og*n) , 



iij-i + (q- 1 ^ ( m ) s logn + (q-l)log (k+1) n + (q-1) (logn -21og(q-l) 
K+i. q 

- log (k+1) n - 21og((q-l)t) - q - (q-2)log(log*n)) 
£ q(logn - (q-2)log(log*n) - 21og(ql) - q) 

= q* Vi (n) ■ 



This completes the proof of Case k , l^k^log*n - 1. 
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Proof of Case log*n 

In this case we have the assumption 

Y Q(x.) > n/2 - 1 . < 12 > 
(Kj<2(q-l)+logn 

But by Equ.(5) of Sec. 2.3 we have 

EO 1 Tlognl -3 

Q(x.) £ 2 U + 2 + H» + 2 

0^j^riognl-3 2 riognl-2 _ x 

= n/4 - 1 . (13) 

From Equs. (12) and (13), we get 

V Q(x.) > n/4 . 
logn-3<j<logn+2(q-l) 

Hence the maximum of Q(x.), logn-3 < j < logn+2(q-l), must be greater 

than n/8(q+l) . 

As in (I) of the proof of Case k , k < log*n , we need only show that 

h (n/8(q+l)) ^ h , (n) , then the result will follow immediately from 
q q+1 

Proposition 1 of Sec. 3.4.1. The calculation is straightforward: 

h (n/8(q+l)) £ logn - 2 - log(2q+2) - (q-3) log(log*n) - 21og((q-l) j)-(q-l) 

q 

=* logn - (q-2)log(log*n) - 21og(q!) - q 

= Vi (n) - 

(We assume here n > 4 due to the remark following Equation (11).) 
This proves Case log*n , and the proof of Lemma 2 is now complete. 
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CHAPTER 4 
FINDING A MEDIOCRE PIAYER 

4.1 Introduction 

In the preceding chapters we have successfully extended Kislitsyn's 
lower bound to the case when, instead of finding the second best of n 
players, the i-th best is to be determined. We now turn our attention 
to another type of selection problems. 

Given i and j with i + j + 1 £ n, our objective now is to find a 
player who is neither among the i top players nor one of the j worst 
players. We shall say such a player is (i,j) -mediocre. Historically 
this problem is closely connected with the finding of the median ele- 
ment, whose starting point is usually the selection of an element that 
is not too close to either extreme. 

Technically, the selection of the i+lst largest element is a 
special case of this "mediocre player" problem in which n = i+j+1. 
This connection suggests a way of finding an (i,j) -mediocre player. 
We simply pick i+j+1 elements arbitrarily and select the i+lst 
largest among them; it is obvious that this element satisfies the 
"(i,j) -mediocre" requirement. The question that naturally follows is 
"Is this the best algorithm ?" In Sec. 4.3 we will prove that this is 
the case when i=l. We will also show that if the answer to this 
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4.2 Properties of S(i,j,n) 

For i+j+1 £ n, let S(i,j,n) denote the minimum number of com- 
parisons needed to find an (i,j) -mediocre element as defined in Sec. 4.1. 
The following facts are easy to verify: 

(i) S(i,j,n*) <: S(i,j,n) if n'S: n :> i+j+1. 

This is true since from n' elements we can arbitrarily choose 
n elements and in S(i,j,n) comparisons find a desired (i,j)- 
mediocre element. 

(ii) There is an integer iu. £ 12 (i+j+1) such that 
S(i,j,n) = S(i,j,n Q ) for all n 2: n Q . 

Proof By the algorithm of Blum et al[l], it takes less than 
6m comparisons to find the t-th largest of m elements for any 
t<:m. Since S(i,j, i+j+1) = V. +1 ( i+j+1), we thus have 
S(i,j,n) <; S(i,j, i+j+1) <: 6(i+j+l) for all n :> i+j+1 by (i). 
Since an optimal algorithm never consults more than 12 (i+j+1) 
elements, we must have 

S(i,j,n) £ S(i,j,l2(i+j+l)) for n^ 12(i+j+l). 

Thus, for fixed i, j, the function S is non- increasing in n and 
reaches a constant before n is 12 (i+j+1). 
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As a result, we have 

Theorem 4 For n £ i+j+1, S(i,j,n) is independent of n if and only 
if selecting (optimally) the i+lst largest element of an arbitrary 
subset of i+j+1 elements is an optimal procedure for finding an 
(i,j) -mediocre element in n elements. 

Proof If S(i,j,n) is independent of n, then 

S(i,j,n) = S(i,j, i+j+1) = V. +1 (i+j+D . 

This shows that the proposed algorithm is optimal. On the other hand, 
if the algorithm described in the theorem is optimal, we will have 

S(i,j,n) = V i+1 (i+j+l) for all n :> i+j+1 
which shows that S(i,j,n) is independent of n. 
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4.3 S(l,j,n) = V 2 (j+2) 

Because of Theorem 4 of the last section, it becomes an intri- 
quing question whether S(i,j,n) is independent of n or not. In this 
section we will show that S(l,j,n) jLs independent of n. This 
lends support to the conjecture that S(i,j,n) depends only on i and 
j. We will discuss some implications of this conjecture in the next 
section. 

Theorem 5 S(l,j,n) = V 2 (j+2) = j + riog(j+2)1 

Proof It suffices to show that S(l,j,n) 2: j + l~log( j+2)l . For any 
given algorithm that finds a (l,j) -mediocre player, we shall apply 
Basic Strategy (cf. Sec. 2.2) to it and call the resulting tournament 
T. Suppose x is the (l,j) -mediocre player found in T, define 

A = fx I either x=x or x is determined to be inferior to 

1 m 



x by T }. 



m 



By the definition of "(1, j)-mediocre" x must be defeated at least once 



m 



in T; hence none of the guys in A is an undefeated player. Also, let 
B = [y | y is undefeated at the end of T and has weight > 1}. 

Thus, if B = [y^ ..., y }, then y 1 , ..., y r are those players corre- 
sponding to the roots that have at least one descendant (besides 
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itself) in the first-defeat- tree of T. Certainly A il B = 0. 

Claim If for kKr, y has weight m , then we must have 

r 

£ riog(m.)"l s riog(j+2)l . 

k * V 1 

Proof of Claim Suppose for l^J^r, we have 2 < m^ <: 2 

Then, r 

E riog(m )1 = (k- + 1) + ••• + (\+ 1) 
4=1 

= r + (k^ + ••• + k f ) . 

On the other hand, 

k-+l k +1 

1 r 

m, + • • • + m ^2 + • • • + 2 
1 r 

k n +l k +1 

<, 2 ••• 2 r 

(k-+ ••• + k ) 
9 r v 1 r 

= 2 • 1 . 

Hence 

flog^ + ••• + m )1 s r + (kj_ + ••• + k r ). 



(U) 



(15) 



However, since every player in A must be the descendant of some y^ , 
\<.Sbsr, we have 



m 1 + "«+m r ^r + JAjsl + jA|. (16) 



Since by the definition of (i,j) -mediocre we must have |Aj s j+1, 
Equ. (16) becomes 

m, + ••• + m :> j + 2 . (I 7 ) 

1 r 

The Claim then follows from Equs. (14), (15) and (17). 
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Now that Claim has been proved, Theorem 5 follows quite easily. 
Indeed, from Claim and Fact 2 of Sec. 2.2, we know that y^, ..., y 
have played at least riog(j+2)l matches all together. But from the 
definition of A, it follows that at least | A | - 1 matches have been 
played between the players of A (one "crucial" match for each player 

in A other than x ). Since A and B are disjoint, we see that this 

m 

tournament must contain at least 

riog(j+2)l + |A| - Is riog(j+2)l + j 

matches. Thus the proof of Theorem 5 is complete. 
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4.4 Connections with Median Computation 

In the last section we proved that S(l,j,n) is independent 
of n. This by no means implies S(i,j,n) is independent of n 
for other i as well. However, if this should indeed be true, it 
would have the following interesting consequence. 

Proposition If S(i,j,n) is independent of n for all i,j, then one 
can find the median of n elements by using no more than 3n compari- 
sons. 

Proof Let M(n) be the minimum number of comparisons for finding the 
median of n elements. Then, for a given n, to find an (n/2, n/2)-medi- 
ocre element among 3n/2 elements' one oan proceed in two ways: 

1) Pick any n + 1 elements and find their median. This requires 
M(n+1) comparisons in the worst case. 

2) Divide the 3n/2 elements into n/2 triplets and sort each 
triplet. Then take the central elements of all the triplets and 
find their median. This element is easily seen to be a desired 
(n/2, n/2) -mediocre element, and this method requires 

3* (n/2) + M(n/2) comparisons in the worst case. 

Now, if S(i,j,n) is independent of n then the first method is 
optimal by Theorem 4. Therefore we have 



^mmmmmmmmmmmtmmmmmmfKim 
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M(nfl) * 3n/2 •+ M(«IHtjyfaot.» 1 TV-g 
It can be proved by induction that M(n) < 3s. Of eoaree we have 
ignored new fine deteile iaifthtt? 

account in^^coamtnffca^a 9 ^tof^ 



x WiW'" 






: CV ■: -r - --»-:.*.< 



^s^'isfsfpjjj sc jxA s»ilT ;-,£?,;•;, jr.-' 



■;?:* jfjI'iassFslr" ^IsKfaiJ ski* orai.bi;rj lei 
c\€,; .,^'*ei^!a' ' «3iMK»StfA .saris 



'.v,- •' i \ 3«Itrf nlmtisuM s -i.&!»3.;i3.'; \X] 
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